Dataverse Replication Package

Purpose
This package contains a cleaned replication-oriented script set for the analyses


Package structure
- 01_run_all.R
    Master script that sources the individual analysis scripts.
- R/
    Individual analysis scripts, one per analysis.
- data/
    Place to store the input data files.
- outputs/
    Scripts write output tables and text summaries here.

Data files
The scripts intentionally use placeholders instead of fixed file names:
    "data/change to data file name"

Before running, replace the placeholder in each script with the actual file name.

Expected inputs by script
- R/02_disfluency_tc_main_permutation.R
    CSV with columns:
    participant_id, tc, disf_loc
    Optional extra columns allowed.
    The uploaded file final_cleaned_disfluency_tc.csv 

- R/03_disfluency_tc_cellwise_residuals.R
    Same input structure as script 02.

- R/04_within_clause_group_comparison.R
    Table with one row per participant and columns:
    group, prop_w
    where group contains novice/expert or novice/therapist labels.

- R/05_silence_proportion_group_comparison.R
    Raw or processed dataset that can be summarized to one row per participant with columns:
    participant_id, group, text
    where text contains disfluency labels sp and fp.
    This script assumes a long format similar to the working code.

- R/06_tc_group_permutation.R
    Raw think-aloud spreadsheet with columns similar to:
    tier, text, ID, ניסיון מקצועי
    excel file named: TAdisfluencyDataupload_8.3.26

- R/07_relisten_group_comparison.R
    Excel file with relisten counts:  relisten data.xlsx 
    

How to run
1. Put your input data files in the data/ folder.
2. Open each script and replace:
       "data/change to data file name"
   with the correct file name.
3. Run:
       source("01_run_all.R")

Required R packages
dplyr
tidyr
readr
readxl
stringr
coin
boot

